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LNTRBOOUCTION 


In theorys terminal input to Multics is converted by the 
ring~zero typewriter DIM to "canonical form", i. esr the physical 
appearance of a line uniquely defines the form in which it will 
be stored. IN additions well-defined meanings are attached to 
input Streams containing erases kills and escape characters, - 


In actual facts the current typewriter DIM does not meet the 
goals described in the preceding paragraph. The three basic types 
of canonicalization (column assignments erase/kills» and escape) 
ate each handled more or less correctlyse but the current design 
does not tend itself to correct and consistent processing of 
combinations of canonicalization types. The trouble is that the 
three types are handled more or less simultaneously. Thus the 
final input resulting from strings such as "\027", "“\016#7", 
"NOOO", "“HO25", etc.wr 18 not predictable under’ the current 
implementation. 


A redesiaqneds, more efficient version of tty_read is planned. 
for Multics release 4.0- in the course of the new designe 
canonicalization will be cleaned up and made consistent. The 
details of this new design will be discussed in a future MIBs the 
purpose of the present document is to set forth a complete 
description of the rules of canonicatization that the new 
tty_read will implement. It is proposed that the rules described 
here be adopted as a standard for all situations in Multics where 
Canonicalization is required. 


CANONICALIZATION RULES 


The three types of canonicalization named above must be 
performed separately in a defined orderr to ensure consistency 
and predictability. In particulars the canonicalization process 
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is conceptually divided into the following steps: 
ee the terminal is in “can” mode» perform 


column-assignment canonicalization on the typed input. 


2. lf the terminal is in "“erkl" modes perform erase/kill | 
canonicalization on the result of step 1. 


3. If the terminal is in "esc™ modes perform escape 


canonicalization on the result of step 2. 


Of courses the actual implementation does not necessarily have to 
perform the three steps in sequences provided that the result is 
the same as would have been achieved by doing so. 


The three types of canonicalization are discussed in more 
detail below. lf two or more of the rules Listed below are 
applicable to a given input strings they are applied in the order 
in which they are presented here. 


COLUAN ASSIGNMENT 


This phase 1s concerned with determining which printing 
graphicse if anys appear in each physical column position. This 
is determined according to the following rules. 


Rules_For_the_ Interacetation of Inout Characters 
1. The Leftmost position of the carriage is considered to be 
column 1, 


2. Each printing graphic or space typed increases the column 
position by 1. 


3. Each backspace typed decreases the column position by 1 
unless the column position is 1. 


4. A carriage return sets the column position to 1. 


S. A horizontal tab increases the column position to the 
next tab stops tab stops are defined to be at columns 11, 
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214 314 etcs 


6. A newline, form feeds or vertical tab sets the column 
position to 1 and advances the carriage vertically: thus 
no character typed after such a character can share a 
column position with a character typed before it. 


Rules_for_the Formation of the Canonical String 


7. Characters on each Line are sorted so that their 
associated column positions are monotone increasing. 


8. No carriage return characters may appear in the canonical 
string. 


9. A horizontal tab is preserved as typed unless a printing 
graphic appears in one of the columns skipped by the tabs 
in which ‘case the tab 1s replaced by an appropriate 
number of spaceS. . 


10. Backspaces appear in the canonical string only when two 
or more printing graphics share a column position. 


11. When two or more different printing graphics share a 
“column positions, the characters are sorted as follows: 
graphic with lowest mumeric ASCII codes backspace, 
graphic with next lowest numeric ASC11 codes etc. 


12. If the contents of a column position consist of two or 
more instances of the same printing graphics that column 
is.reduced to a single instance of the graphic. © 


13. A Lline-ending character (newlines form feeds or vertical 
tab) immediately follows the last printing graphic in the 
rightmost column position on the line. 


ERASE AND KILL CHARACTERS 


The placement o f erase/kill canonicalization after 
column-assignment canonicalization and before escape 
canonicalization is strategic in that it causes” erase/kill 
processing to work by cQlumo aasition rather than by Character. 
This eliminates ambiguity with respect to erase Characters 
combined with escape sequences. (See the examples at the end of 
this document, ) 
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The rules for erase and kill canonicalization are given 
below. 


14. An erase character alone ina column position results in 
the detetion of itself and. of the contents of the 
preceding column position. . 


15.2 An erase character alone in a column position and 
preceded by more than one blank column results in the 
deletion of all immediately preceding blank columnss = as 
well as of the erase character. 


16. An erase character sharing a column position with one or 
more printing graphics results in the deletion of the 
contents of that column position. 


17. A kill character results in the deletion of its own 
column position and alt column positions to.its lefts 
unless it shares a calumn position with = an erase 
characters» in which case rule 16 appties (Cthe kilt 
character is erased). 


18. If the terminal is in "esc" moder an erase or kill 
character alone in a column immediately preceded by an 
escape character alone in a column is not processed as an 
erase or kill character, 


Note that for rule 18 to apply» the erase or kill character must 
actually have been typed in the column immediately following the 
escape character. The reason for this is that it facilitates the 
erasing of escape sequencess e.g.r \OOLFHHH. 


ESCAPE SEQUENCES 


The processing of escape sequences is performed according to 
the rules given below. 


19. An escape sequence consists of an escape character alone 
in its column position followed by one or more printing 
graphics each of which is alone in its column position. 
An escape sequence is replaced by a single character in 
the canonical string. 


20. An escape sequence consisting of two successive escape 
characters is replaced by an eScape character. 
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An escape sequence consisting of an escape character 
followed by an erase (or kill) character is replaced by 
an erase for kill) character. 


An escape sequence consisting of an escape character 
followed by ones twor or three octal digits is replaced 


‘by the character whose ASCII value is represented by the 


sequence of octal digits. 


An escape character followed by a newline character 
results in the deletion of both characters from the 
canonical string. - 


Other escape sequences may de> defined on a 
per-terminal-type basis» where such a sequence consists 
of an escape character and one character following. 


i ; 

If the character following an escape character does not 
result in an escape sequence as defined by rules 20-24, 
the escape and foilowing characters are stored as they 


appear on the line. 


EXAMPLES. 


are 


In the examples belowse the following conventions. are used: 


<NL> ““Ppepresents a newline 


~ €CR> represents 2 carriage return 
<BS> represents a backspace 
<HI> — represents a horizontal tab 
<SP> represents a space 
{nnn} represents a character whose ASCII value is 


nnn (octal) 


\ 1s the escape character 
# 1s the erase character 
a is the kill character 


The examples in the first group illustrate how various typed 
sequences are canonicalized in terms of column positions these 
followed by examples of erases kill» and escape 
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canonicalization. In the second groups Lines are shown as they 
appear physically» with no consideration given to the precise 
sequence of keystrokes that might have produced them, 


COLUMN CANONICALIZATION EXAMPLES 


Example 1 


lyped: Nothing special about this Line.<NL> 


Appearance: Nothing special about this line. 


Results Nothing special about this lLine.<NL> 
Example e 
Typed: Extraneous white s<SP><BS>pace is ignored.<CR><SP><NL> 


Appearances: Extraneous white space 1S ignored. 


Results: Extraneous white space is ignored.<NL> 
Example_3 
Typed: Two ways (2<BS>_) to overstrike.<CR>___<NL> 


Appearance: Jwo ways (2) to overstrike. 


Result: T<BS>__L<BS>w_<BS>o0 ways (2<BS>_) to overstrike.<NL> 
Exampole_4 

Typed: Tab + backspace is<HT><BS>reduced to spaces.<NL> 
Appearances: Tab + backspace is reduced to spaces. 


Results: Tab + backspace 1s<SP><SP><SP><SP>reduced to spaces.<NL> 
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(See rule 9.) 


ERASE-KILL AND ESCAPE EXAMPLES 


Examole 5 


Appearance: abz4cde 


Results abcde 
Examole_6 

Appearance: ab acde 
Results abcde 
Examole 7? 


Appearances NotadNever obBn Sunday. 


Results Never on Sunday. 


Example 8 


Appearances Nox#w it's right. 


Results Now it's right. 


Examole 9 


Appearance: Nox#w it*s right. 
Result: . Noxw it's right. 


(Erase character is overstrucks see Rule 16.) 
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Example 10 


~ 


Appearances: dcl errs char (1) static initC*\OI7#6"%), 


Result: del rrs char (1) static init("{016)"); 


Example_ll 


Appearances \023 
Result: {002}3 


(Overstruck 3 18 not part of escape sequence.) 


Examole_le 


Appearances: \11l2 
Results N112 


COverstruck \V 18 not an escape character.) 


Example 13 


Appearances: a\#ab 
Results a\b 


(First # 1S not an erase character by rule 18% second & erases 
itself and preceding 4 by rule 14.) 


Example_14 (similar to txample 13) 


Appearances a\adAlb 


Results a\b 
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Example 15 


Aopearance: aH#ab 
Results b 


(The \ is erased by the overstruck #.) 


Example 16 


Appearances: a\\#b 
Results: a\#b 
(Erase canonicalization does not recognize the # by rule 18, 


escape canonicatization recognizes \\ by rule 20.4 and attaches no 
special meaning to the #.) 


Example 12 


Appearance: a\\##b 
Result: a\b 
(By rule 18.4 the first # is not an erase characters by rule 14, 


the second # erases itself and the preceding #- then rute 20 
reduces \\ to \.) 


Example 18 


Appearances: a\\A##b 
Result: a\b 


(The first # is not an eraser the next two arer erasing the 
second \ and the first #.) 
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Example_19 


Appearance: aN\\AHHAdb 
Result: ‘ab 
(The first 4 15 not an eras@s and must be erased before the two \ 


characters. Examples 16-19 illustrate the difficulty of erasing a 
double \s the clearest method is probably to overstrike (aN¥Nb).) 


Example. 20 (on 2741-like terminal) 


Appearances: at<fb 
Results a\b 


(Only the < is eraseds € 18 translated to \.)_ 


